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ALPHA-DIVERSITY PROCESSES AND NORMALIZED 
INVERSE-GAUSSIAN DIFFUSIONS 
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University of Torino, University of Kent and University of Torino 

The infinitely-many-neutral-alleles model has recently been ex- 
tended to a class of diffusion processes associated with Gibbs par- 
titions of two-parameter Poisson-Dirichlet type. This paper intro- 
duces a family of infinite-dimensional diffusions associated with a 
different subclass of Gibbs partitions, induced by normalized inverse- 
Gaussian random probability measures. Such diffusions describe the 
evolution of the frequencies of infinitely-many types together with 
the dynamics of the time-varying mutation rate, which is driven by 
an a-diversity diffusion. Constructed as a dynamic version, relative 
to this framework, of the corresponding notion for Gibbs partitions, 
the latter is explicitly derived from an underlying population model 
and shown to coincide, in a special case, with the diffusion approxi- 
mation of a critical Galton-Watson branching process. The class of 
infinite-dimensional processes is characterized in terms of its infinites- 
imal generator on an appropriate domain, and shown to be the limit 
in distribution of a certain sequence of Feller diffusions with finitely- 
many types. Moreover, a discrete representation is provided by means 
of appropriately transformed Moran-type particle processes, where 
the particles are samples from a normalized inverse-Gaussian random 
probability measure. The relationship between the limit diffusion and 
the two-parameter model is also discussed. 

1. Introduction. Considerable attention has been devoted recently to a 
class of diffusion processes which extends the infinitely-many-neutral-alleles 
model to the case of two parameters. This family takes values in the space 

(1.1) Voo = |^ = (zi,Z2,...):zi>Z2>--->0,^z, <ll, 
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namely, the closure in [0, 1]°° of the infinite-dimensional ordered simplex, 
and is characterized, for constants < a < 1 and 9 > —a, by the second 
order differential operator 

(1.2) = - ^ - ^ - 5 E(«-. + °) 8i 

i,j=l i=l 

acting on a certain dense sub-algebra of the space C(Voo) of continuous 
functions on Vqo (throughout the paper 6ij denotes Kronecker delta). The 
diffusion with operator (1.2) describes the evolution of the allelic frequen- 
cies at a particular locus in a large population subject to random genetic 
drift and mutation, where mutation is jointly driven by the parameters 
{0,a). Ethier and Kurtz (1981) characterized the corresponding process 
when a = 0, whereas the two-parameter family was introduced by Petrov 
(2009) and further investigated by Ruggiero and Walker (2009) and Feng 
and Sun (2010). The latter is known to be stationary, reversible and ergodic 
with respect to the Poisson-Dirichlet distribution with parameters {9, a). 
This was introduced by Pitman (1995) [see also Pitman (1996) and Pitman 
and Yor (1997)] and extends the Poisson-Dirichlet distribution of King- 
man (1975) as follows. Consider a random sequence (Vi, V2, . . .) obtained by 
means of the so-called stick-breaking scheme 

71-1 

(1.3) ^1 = 1^1, Vn = WnY[{l-Wi), Wi'^Betail-a,9 + ia), 

i=l 

where < a < 1 and 9 > —a. The vector (Vi, V2, . . .) is said to have the 
GEM distribution with parameters {9, a), while the vector of descending or- 
der statistics (V(i) , V(2) , . . .) is said to have the Poisson-Dirichlet distribution 
with parameters {9, a). The latter is also the law of the ranked frequencies of 
an infinite partition induced by a two-parameter Poisson-Dirichlet random 
probability measure, which generalizes the Dirichlet process introduced by 
Ferguson (1973). Two-parameter Poisson-Dirichlet models have found ap- 
plications in several fields. See, for example, the monographs by Bertoin 
(2006) for fragmentation and coalescent theory. Pitman (2006) for excursion 
theory and combinatorics, Teh and Jordan (2010) for machine learning, Lijoi 
and Priinster (2010) for Bayesian inference and Feng (2010) for population 
genetics. See also Bertoin (2008), Handa (2009) and Favaro et al. (2009). 

The Poisson-Dirichlet distribution and its two parameter extension in 
turn belong to a larger class of random discrete distributions induced by 
infinite partitions of Gibbs type. These were introduced by Gnedin and Pit- 
man (2005), and applications include fragmentation and coalescent theory 
[Bertoin (2006), McCuhagh, Pitman and Winkel (2008), Goldschmidt, Mar- 
tin and Spano (2008)], excursion theory [Pitman (2003)], statistical physics 
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[Berestycki and Pitman (2007)] and Bayesian nonparametric inference [Li- 
joi, Mena and Priinster (2005, 2007a, 2007b), Lijoi, Priinster and Walker 
(2008a)]. See Pitman (2006) for a comprehensive account. See also Griffiths 
and Spano (2007), Lijoi, Priinster and Walker (2008b) and Ho, James and 
Lau (2007). 

This paper introduces a class of infinite-dimensional diffusions associated 
with a different subclass of Gibbs-type partitions, induced by normalized 
inverse-Gaussian random probability measures. Such discrete distributions, 
recently investigated by Lijoi, Mena and Priinster (2005), are special cases 
of generalized gamma processes [Pitman (2003), Lijoi, Mena and Priinster 
(2007b)], and their intersection with two-parameter Poisson-Dirichlet mod- 
els is given by the sole case {0,a) = (0,1/2), which corresponds to a nor- 
malized stable process with parameter 1/2 [Kingman (1975)]. The class of 
diffusions studied in this paper is characterized in terms of the second order 
differential operator 



acting on a dense sub- algebra of Co([0,oo) x Voo), the space of continuous 
functions on [0,oo) x Vqo vanishing at infinity, for parameters (/3,a), with 
/3 = ar"/a, a > 0, r > and a = 1/2. By comparison with (1.2), it can 
be seen that the last two terms of (1.4) describe the time evolution of the 
frequencies of infinitely-many types. Common features between (1.2) and 
(1.4) are the variance-covariance terms Zi{5ij — zj) and the structure of the 
drift or mutation terms —[{f3/s)zi + a]. The distinctive feature with respect 
to (1.2) is given by the fact that the positive coefficient Of = P/St varies 
in time, and is driven by what is termed here a-diversity diffusion, whose 
operator is given by the first two terms of (1.4). Equivalently, St follows the 
stochastic differential equation 



where Bt is a standard Brownian motion. This can be seen as a particular 
instance of a continuous-time analog of the notion of a-diversity, introduced 
by Pitman (2003) for Poisson-Kingman models, which include Gibbs-type 
partitions. An exchangeable random partition of N is said to have a-diversity 
S if and only if there exists a random variable S, with Q < S < oo almost 
surely, such that, as n — )• oo. 




(1.4) 




(1.5) 





(1.6) 




a.s. 
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where Kn is the number of classes of the partition of {1, . . . ,n}. The con- 
nection between (1.5) and (1.6) wih become clear in Section 4, where the 
a-diversity diffusion will be explicitly derived. 

It is to be noted that (1.2) is not a special case of (1.4). Indeed, the 
only way of making 9t = (i/ St constant is to impose null drift and volatility 
in (1.5), which implies 0^ = 0. Hence, consistently with the above recalled 
relation between normalized inverse- Gaussian and Poisson-Dirichlet random 
measures, (1.2) and (1.4) share only the case (0t,a) = (0, 1/2). Nonetheless, 
an interesting connection between these classes of diffusions can be stated. In 
particular, it will be shown that performing the same conditioning operation 
in a pre-limit particle construction of normalized inverse-Gaussian diffusions 
yields a particular instance of the two-parameter model. 

The paper is organized as follows. Section 2 recalls all relevant definitions, 
among which are Gibbs-type partitions, the associated generalized Polya-urn 
scheme and random probability measures of generalized gamma and normal- 
ized inverse-Gaussian types. Section 3 derives some new results on gener- 
alized gamma processes which are crucial for the construction. These are 
concerned with the convergence of the number of species represented only 
once in the observed sample and with the second order approximation of 
the weights of the generalized Polya-urn scheme associated with normalized 
inverse- Gaussian processes. In Section 4, by postulating simple population 
dynamics underlying the time change of the species frequencies, we derive 
the a-diversity diffusion for the normalized inverse-Gaussian case, by means 
of a time-varying analog of (1.6) with the limit intended in distribution, 
and highlight its main properties. In Section 5 normalized inverse-Gaussian 
diffusions are characterized in terms of the operator (1.4), whose closure is 
shown to generate a Feller semigroup on Co([0,oo) x Vqo), and the associ- 
ated family of processes is shown to be the limit in distribution of certain 
Feller diffusions with finitely-many types. Section 6 provides a discrete rep- 
resentation of normalized inverse- Gaussian diffusions, which are obtained as 
limits in distribution of certain appropriately transformed Moran-type par- 
ticle processes which model individuals explicitly, jointly with the varying 
population heterogeneity. Finally, Section 7 shows that conditioning on the 
a-diversity process to be constant, that is, St = s, in a pre-limit version of 
the particle construction yields, in the limit, the two-parameter model (1.2) 
with (e,a) = (sV4,l/2). 

2. Preliminaries. The Poisson-Dirichlet distribution and its two param- 
eter extension belong to the class of random discrete distributions induced by 
infinite partitions of Gibbs type, introduced by Gnedin and Pitman (2005). 
An exchangeable random partition of the set of natural numbers is said 
to have Gibbs form if for any \ <k<n and any (ni,...,refc) such that 

nj G {1, . . . , n}, for j = 1, . . . , k, and rij = n, the law 11^"^ of the parti- 
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tion (ni, . . . ,nfe) can be written as the product 

k 

(2.1) nj^"^(ni, . . . , nfc) = Vn,kJ[{l - a)n,-i. 

i=i 

Here < a < 1, 

(2.2) (a)o = l, (a)„, = a(a + l)---(a + m-l), m > 1, 

is the Pochhammer symbol and the coefficients {Ki,fc '-k = 1, . . . ,n;n > 1} 
satisfy the recursive equation 

(2.3) = (n - aA;)K+i,A.- + K+i,fc+i- 

The law of an exchangeable partition is uniquely determined by the func- 
tion n^"^(ni, . . . ,nfc), called the exchangeable partition probability function, 
which satisfies certain consistency conditions, which imply invariance under 
permutations of {1, . . . ,n} and coherent marginalization over the (n + l)th 
item. Hence, the law of a Gibbs partition is uniquely determined by the 
family {Vn^k : A: = 1, . . . , n; n > 1}. Furthermore, a random discrete probabil- 
ity measure governing a sequence of exchangeable observations is said to 
be of a Gibbs type if it induces a partition which can be expressed as in 
(2.1). These have associated predictive distributions which generalize the 
Blackwell and MacQueen (1973) Polya-urn scheme to 

(2.4) 

= go{n,Kn)vo{-) + 9i{n,Kn)^{nj - a)5x*{-), 

i=i 

where i^q is a nonatomic probability measure, Xj*, . . . are the Kn dis- 

tinct values observed in Xi, . . . ,Xn with absolute frequencies ni, . . . ,nK„, 
and the coefficients g^ and gi are given by 

(2.5) go{n,k) = —-- , g^[n^k) = — 

Vn,k Vn,k 

with {Vn^k - k = 1, . . . ,n;n>l} as above. It will be of later use to note that 
integrating both sides of (2.4) yields 

(2.6) go{n,Kn) + {n - aKn)gi{n, Kn) = 1, 

also obtained from (2.3) and (2.5). Examples of Gibbs-type random prob- 
ability measures are the Dirichlet process [Ferguson (1973)], obtained, for 
example, from (2.4) by setting ^ > and a = in 

. 6 + ak 1 

(2-7 go[n,k) = ——, g^{n,k) = -——, 

(J + n + n 

the two-parameter Poisson-Dirichlet process [Pitman (1995, 1996)], obtained 
from (2.7) with < a < 1 and 9 > —a, the normalized stable process [King- 
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man (1975)], obtained from (2.7) with < a < 1 and ^ = 0, the normalized 
inverse-Gaussian process [Lijoi, Mena and Priinster (2005)] and the normal- 
ized generahzed gamma process [Pitman (2003), Lijoi, Mena and Priinster 
(2007b)]. See also Gnedin (2010) for a Gibbs-type model with finitely-many 
types. 

The normalized generalized gamma process is a random probability mea- 
sure with representation 

oo 

(2.8) ^i = Y,P^Sx„ 

i=l 

whose weights {Pi,i £ N} are obtained by means of the normalization 

J,. 



(2-9) Pi — ^CXD 7 5 

where {Ji,i G N} are the points of a generalized gamma process, introduced 
by Brix (1999). This is obtained from a Poisson random process on [0,oo) 
with mean intensity 

A(ds) = — ^ — -exp(-rs)s-(^+")ds, s > 0, 
1 (1 — a) 

with < a < 1 and r > 0, so that if A^(^) is the number of Jj's which fall 
in yl G =^([0,oo)), then N{A) is Poisson distributed with mean A(^). Lijoi, 
Mena and Priinster (2007b) showed that a generalized gamma random mea- 
sure defined via (2.8) and (2.9), denoted by GG(/3, a), where /3 = ar"/a with 
a > and r > 0, induces a random partition of Gibbs type with coefficients 
goin,Kn) and gi{n,Kn) in (2.4) given by 

. aEr=o(")(-ir/^^/"r(fc + l-z/a;/3) 

^ nEL-o^(Y)(-W/"r(fc-V«;/3)' 

(2.10) 

i:r=o(:)(-ir/3^/°r(fc-z/a;/j) 
' nEr=-o^(Y)(-l)^/3^/"r(A:-i/a;/3)' 
where T{c;x) denotes the upper incomplete gamma function 

POO 

(2.11) r(c;x)= / s'^-^exp(-s)ds. 

J X 

Special cases of a generalized gamma process with parameters (/3, a) are the 
Dirichlet process, obtained by letting r = 1 and a — )■ 0, the normalized stable 
process, obtained by setting /? = 0, and the normalized inverse-Gaussian 
process, obtained by setting a = 1/2. 

We conclude the section with a brief discussion of the interpretation of 
a in the context of species sampling with Gibbs-type partitions. Suppose 
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Kn different species liave been observed in tlie first n samples from (2.4). 
The probability that a further sample is an already observed species is 
gi{n,Kn){n — aKn), but this mass is not allocated proportionally to the 
current frequencies. The ratio of probabilities assigned to any pair of species 



n,- — a 



a 



When a —7- 0, the probability of sampling species i is proportional to the 
absolute frequency rij. However, since for rii > rij, (n^ — a)/{nj — a) is in- 
creasing in Q, a value of a > reallocates some probability mass from type 
j to type i, so that, for example, for nj = 2 and nj = 1 we have rij = 2, 3, 5 
for q; = 0, 0.5, 0.75, respectively. Thus, a has a reinforcement effect on those 
species that have higher frequency. See Lijoi, Mena and Priinster (2007b) 
for a more detailed treatment of this aspect. 

3. Some results on generalized gamma random measures. In this sec- 
tion we investigate some properties of generalized gamma random measures 
which will be used in the subsequent constructions. In particular, these re- 
gard the convergence of the number of species represented only once in the 
observed sample, and the second order approximation of the weights of the 
generalized Polya-urn scheme associated with normalized inverse- Gaussian 
processes. 

Let Xi , . . . , Xn be an n-sized sample drawn from a generalized gamma 
process with parameters (/3,a), let Kn denote the number of distinct species 
observed in the sample, and let N„ := {Ni, . . . , Nk^) denote the vector of 
absolute frequencies associated with each observed species. The probability 
distribution of the random variable {Kn, N„), for any n>l, k = 1, . . . ,n and 
frequencies (ni,...,nfc) such that X^j!^^ ^^^ = ''^j is provided by Lijoi, Mena 
and Priinster (2007b) and coincides with 

_a^-Vn,ti(l-a)(n,-i)^Vn-l\, s 



■E(";')(-W/"r(.-^;/3), 



r(n) 

where (1 — a)(nj-i) and r{k — s/a; f3) are as in (2.2) and (2.11), respectively. 
Denote now by Mj^n the number of species represented j times in the sample. 
Then from equation 1.52 in Pitman (2006) it follows that the distribution 
of M„ := {Mi^n, Mn,n) is given by 

P(M„ = (mi,„,...,m„,„)) 

(1 -«)(,„!) 
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X 



^E(»;')(-ir.-r(.-£;,) 

for any n > 1, k = 1, . . . ,n and vector . . . , nin^n) G ■Mn,k, where 

{n n \ 

(mi,„, . . . , rUn^n) ■ mi,n > 1. '"i,n = ^' X] = n > . 

1=1 1=1 J 

The following proposition identifies the speed of convergence of the number 
of species represented once in the sample. Denote by ^{n,k,a) the general- 
ized factorial coefficient 

(3.3) nn,k,a) = ^E(-l)' ( ("i")"' 

where ^(0,0, a) = 1 and '^(n,0,a) =0. See Charalambides [(2005), Chap- 
ter 2] for a complete account. 

Proposition 3.1. Under the normalized generalized gamma process with 
parameters (/3,a), one has 

P(A-/i,„ = mi,„) 

(3.4) X "'f'Vay'"-""'"-^' + 



n-mi^n-j 

X 

fe=0 



'^(n - mi,„ - j, fc, a) 



\ a 

Moreover, 

(3.5) >aSa a.s., 

where Sa is a strictly positive and almost surely finite random variable with 
density function 



gsAs;a,/3) = e^-W^^' 



/./(s-V";a) 



with f{-;a) being the density of a positive stable random variable with pa- 
rameter a. 
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Proof. Denote {x)[jn] = ^ix — 1) • • • {x — m + 1). From (3.2), for any 
r > 1 one has 

lE[(Mi,„)[r.]] 

" ,a^A/(l-a){i-i)\'"- 1 , . 



k = lMr,,k J = l 

n-1 

X 



E(";')(-i)''*"°r('=-i;/3) 



n-1 

X 

s=0 

n 



E(";')(-i)''*"t('=-^;/3) 



E(":')(-i)'''""r(t-^;4 



n-1 

X 

s=0 

In particular, by using the definition of generalized factorial coefficient in 
terms of sum over the set of partitions M.n,k [see Charalambides (2005), 
equation 2.62], we have 



M„_r,fe_r j = l 

Therefore, we obtain 

E[(Mi,„)[,]] 

(3.6 => -— "iin-r.k-r.a) 

'-^ V (n) 

k=i ^ ' 

n-1 

X 

s=0 

In order to obtain the distribution of the random variable Mi^„, we can make 
use of the probability generating function of Mi^„, denoted Gt^Ux n)(*)- From 



E(":')(-ir/3'"T(*.-i;^), 
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(3.6) we have 

^ Q"~'e^(n)[,] 

r=0 ^ ^ 



n-1 

X 

s=0 



^ ^(n - r, /t, a)r ffc + r 

! — n \ OL J T . 



X 

fc=0 



Therefore, the distribution of Afi^„ is given by 
1 ga™^-+^--^e'^(n)[^,,„+,] 



^ r(n) 



n 

^"^(n - mi^n-j,k,a) 



n-l 

X 

=0 

n 

X 

fe=0 



xr /c + mi,„+j--;/3' ^ ' 



1 ga™^-+^--^e/^(n)[^,_„+,] 



mi,„! r(n 



s=0 ^ ^ 

n-mi,n-J 

x ^ 'if(n - mi,„ - j, A:,a) 



it=i 



a 



x^/'t ii / ^ s OS/a 



r(n)mi n! 



a" y j! 

E(";')(-ir/i 



X ^ (-a)^- 
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n-m\,ri~] 

fc=o 

X T[k + mi^n + j - 
\ a 

where the last identity is due to the fact that ^(n, = for any k > n. 
Proposition 3 in Lijoi, Mena and Priinster (2007b) shows that 

(3.7) Kjn^ ^ 5„ 

almost surely, where 5^ is an almost surely positive and finite random vari- 
able with density function 

with /(•;«) being the density function of a positive stable random variable 
with parameter a. In other terms, according to Definition 3.10 in Pitman 
(2006), an exchangeable partition of N having EPPF (3.1) has a-diversity 
Sa- A simple application of Lemma 3.11 in Pitman (2006) leads to (3.5). □ 

A second aspect of generalized gamma random measures we need to ad- 
dress for later use is the approximate behavior of the coefficients in the 
generalized Polya urn (2.4). It is well known that the first order behavior of 
(2.10) is that of a normalized stable process, that is, 

(3.8) goin,k) ^ ak/n, gi{n,k)^l/n, 

also implied by the next result. However, it turns out that for the definition 
of the diffusion processes which are the object of the next two sections, it is 
crucial to know the second order approximation. The following proposition, 
whose proof is deferred to the Appendix, identifies such behavior for the 
normalized inverse-Gaussian case a = 1/2. 

Proposition 3.2. Let go{n,k) and gi{n,k) be as in (2.10). When a = 
1/2, 

5o(n,fc) = ^ + ^ + o(n-i) 
n n 



1 P/Sn , , _2n 



and 

(3.9) gi{n,k) = ^-^ + o{n--'), 

where Sn = kjn^ and /3 = or" /a. 



4. Alpha-diversity processes. Making use of the results of the previous 
section, here we construct a one-dimensional diffusion process which can 
be seen as a dynamic version of the notion of a-diversity, recalled in (1.6), 
relative to the case of normalized inverse-Gaussian random probability mea- 
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sures. Such diffusion, which will be crucial for the construction of Section 5, 
is obtained as weak limit of an appropriately rescaled random walk on the 
integers, whose dynamics are driven by an underlying population process. 
This is briefly outlined here and will be formalized in Section 6. Consider 
n particles, denoted rr^") = {xi, . . . ,Xn) with rcj G X for each i, where X is 
a Polish space, and denote by Kn = Knix^"^^) the number of distinct val- 
ues observed in {xi, . . . Let the vector {xi, . . . ,Xn) be updated at dis- 
crete times by replacing a uniformly chosen coordinate. Conditionally on 
Js:„(xW) = A;, the" incoming particle will be a copy of one still in the vector, 
after the removal, with probability gi{n — l,kr), and will be a new value with 
probability go{n — 1, kr), where gi{n — 1, k) and go{n — 1, k) are as in (2.10) 
and kr is the value of k after the removal. Denote by {Kn{m),m £ No} the 
chain which keeps track of the number of distinct types in (xi, . . . , Then, 
letting mi^n be the number of clusters of size one in (xi, . . . ,Xn), which, by 
means of (3.5) and (3.7) is approximately ak for large n, the transition 
probabilities for Kn{m), 

p{k, k') = P{Kn(m + 1) = k'\Kn{m) = k} 

are asymptotically equivalent to 

/ ak\ 

(l \go{n-l,k), ifl<k<n,k' = k + l, 

ak 

— gi{n — 1, /c — l)(n — 1 — aik — 1)), 
n 

1 < k <n,k' = k - 1, 
l-p{k,k + l) -p{k,k-l), iik' = k, 
0, else 
for 1 < k <n. That is, with probability 

^i,n/^ ~ ak I Ti a cluster of size one 
is selected and removed, with probability gQ{n — 1, A;) a new species appears 
and with probability gi (n — 1, A;)(n — 1 — a{k — 1)) a survivor has an offspring. 
Note that k = l and k = n are set to be barriers, to render the fact that mi^„ 
equals and n when k equals 1 and n, respectively. 

The following theorem finds the conditions under which the rescaled chain 
Kn{m)/n°' converges to a diffusion process on [0,oo). Here we provide a 
sketch of the proof with the aim of favoring the intuition. The formalization 
of the result is contained in the proof of Theorem 6.1, while that of the fact 
that the limiting diffusion is well defined, that is, the corresponding operator 
generates a Feller semigroup on an appropriate subspace of C([0,oo)), is 
provided in Corollary 4.1 below. 

Throughout the paper CsiA) denotes the space of continuous functions 
from A to B, while X„ =^ X denotes convergence in distribution. 

Theorem 4.1. Let {Kn{'iTi),m G No} be a Markov chain with transi- 
tion probabilities as in (4-1) determined by a generalized gamma process 



(4.1) p{k,k') 
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with (3 >0 and a = 1/2, and define {Kn{t),t > 0} to be such that Kn{t) = 
Kn{l'n^^'^t\)/n" . Let also {St,t>0} be a diffusion process driven by the 
stochastic differential equation 



(4.2) 



dSt = ^<lt + ^/StdBt, 5t>0, 



where Bt is a standard Brownian motion. If Kn{0) =^ ^o, then 
(4.3) {Kn{t),t>0}^{St,t>0} in qo,oo)([0,oo)) as n^oo. 

Proof. Let a = 1/2. From Proposition 3.2 we can write (4.1) as follows 
(for ease of presentation we use n and k in place of n — 1 and k — 1 since it 
is asymptotically equivalent): 

ak\ / ak P/sr, 



p{k,k') 



n 



1 — + 



n 



n 



+ o(n 1), 



-(--^]in-ak)+oir,-y% if 1 

1 - p{k, k + 1)- p{k, k-l) + o(n"^), if k' = k, 
k 0, else. 



if l<k <n,k' = k + 1, 



< k < n,k' = k - 1, 



The conditional expected increment of the process {Kn{m)/n°' ,m £ No} is 
k 



^, k' k 
E{ 



(4.4) 



n" n' 
1 



^ ak\ / ak /3/s„, \ ak f 1 fijsn 



n J \ n 



n 



n \n 



(n — ak) 



+ o 



l+a 



n 



l+a 



n 



+ 



n 



l+a 



Similarly, the conditional second moment of the increment is 

2 



E 



(4.5) 



k' k 
_ 1 

+ o 
2ak 



ak\ / oA; _^ l3/sn \ _ P/sn \ ^ 

n J \ n n J n \n v? J 



n — ak) 



1 



n 



l+2a 



n 



l+2a 



+ 



1 



n 



l+2a 
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Since k ~ sn", and recalling that s almost surely, we have 

k\^P/s 



n 



and 



n 



l+a 



E 



k' 



2as. 



It is easy to check that all conditional mth moments of /Skju^ converge 
to zero for m > 3, whence it follows by standard theory [cf., e.g., Karlin 
and Taylor (1981)] that, as n — )■ oo, the process Kn{t) = Kn{ln^'''^t\)/n°' 
converges in distribution to a diffusion process St on [0,oo) with drift 13 /St 
and diffusion coefficient \/2aSt- □ 



As anticipated, the second order approximation of gQ{n,k) is crucial for 
establishing the drift of the limiting diffusion, as the first order terms can- 
cel. It is interesting to note that when /? = 0, which yields the normalized 
stable case, the limiting diffusion reduces to the diffusion approximation of 
a critical Galton-Watson branching process, also known as the zero-drift 
Feller diffusion. See, for example, Ethier and Kurtz (1986), Theorem 9.1.3. 
This also holds approximately for high values of St, in which case the drift 
becomes negligible. 

In order to have some heuristics on the behavior of the a-diversity process. 
Figure 1 shows 3 x 10^ steps of the random walk {i^„(m)/n°,m G No} with 
dynamics as in Theorem 4.1, starting from 1/y/n with n = 200. The three 
paths correspond to /3 being equal to 0, 100 and 1000. It is apparent how /3 
influences the dynamic clustering structure in the population. 

It is well known that when /3 = 0, the point is an absorbing boundary 
for St- The next result provides the boundary classification, using Feller's 
terminology, for the case /? > 0. 

Proposition 4.1. Let St be as in Theorem 4-1 with /3 > 0. Then the 
points and oo are, respectively, an entrance and a natural boundary. 



Proof. The scale function for the process, defined as 
(4.6) S{x)= I s{y)dy, 0<x<oo, 



where 

a^{t) 



s(y)=exp<' - C^^dt 



yo 
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Fig. 1. Three sample paths of the random walk {Kn(m) /n°' ,m £ No}, with dynamics as 
in Theorem 4-1, starting from with n = 200, for parameter values: (a) /3 = 0, (b) 

/3 = 100, (c) P — 1000. The figures show how /3 influences the dynamic clustering structure 
in the population. 
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.2/ 



and iJ,{x) and a {x) denote drift and diffusion, equals 

S{x)-- 



-2/3/j/o \x(,'2l3/x 



xoe 



2/3/xo 



2/3Ei(2/3/x)+Ei(2^/xo)], 



where Ei(z) is tlie exponential integral 



Ei(z) 



t-^e"* dt. 



Letting S[a, b] = S{b) — S{a), for < a < 6 < oo, we have 
(4.7) 



^(O, b] = lim5[a, b] = oo, 



S[a, oo) = lim S[a, b] = oo. 



Moreover, the speed measure is given by 

rd 

M[c,d]= / [a^{t)s{t)]-^dt 



=2/3/s/o 



[Ei(-2/3/c) - Ei(-2/3/d)] 



from which M(0, d] = limc4.o M[c, d] < oo and 



(4.8) 

Now (4.7) implies that 



M[c, oo) = lim M[c, d] = oo. 



S(0) = lim / S{1, y] dM{y) = oo, 
iio Ji 



S(oo) = lim / S[y, r) dM{y) = oo 

/too.' 



and (4.8) implies 



while 



iV(oo) = lim / S[x, y] dM (y) 

rtoo 



oo. 



iV(0) = lim / S[y,x]dM{y) 
'4-0 Ji 



lim 



xe^^x _ye^P/y ^2p[ Ei( ^ ) - Ei( ^ ) 



2^ 

y 



2i\ 

X j 



dy < oo 
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since 



lim Ei( — ) < oo. 

yi^ y \y J 

The statement now follows from, for example, Karlin and Taylor (1981), 
Sectfon 15.6. □ 

Hence, when /3 > neither boundary point is attainable from the interior 
of the state space, from which the actual state space is [0,oo) for {Stit > 0} 
and (0,oo) for {St,t > 0}. The process can be made to start at 0, in which 
case it instantly moves toward the interior of the state space and never 
comes back. Consequently, we will use (0,oo) or [0,oo) as state space at 
convenience, with the agreement that (0,oo) is referred to {St^t > 0}. 

As a corollary, we formalize the well-definedness of the a-diversity diffu- 
sion. Denote by Co [K) the space of continuous functions vanishing at infinity 
on a locally compact set K, and let || • || be a norm which makes Cq{K) a 
Banach space. Recall that a Feller semigroup on Cq{K) is a one-parameter 
family of bounded linear operators {T{t),t > 0} on Co{K) such that T{t) 
has the semigroup property T{s + t) = T{s)T{t) for all s,t > 0, is strongly 
continuous, that is, 

\\T{t)f-f\\^0 asi^O,/GCo(K), 

and, for all t > 0, T{t) is a contraction, that is, < 1, is conservative 

in the sense that T{t)l = 1, and is positive in the sense that it preserves the 
cone of nonnegative functions. 

Corollary 4.1. For /3 > 0, let Aq be the second order differential op- 
erator 

. d 1 d^ 

'"° = 7dl + rdi^ 

and define 

(4.10) &{Ao) = {/ G Co([0,oo)) n C\{0, oo)) : A/ G Co([0, oo))}. 
Then {{f,Aof) : f £ ^{Aq)} generates a Feller semigroup on Co([0, oo)). 

Proof. The result follows from Proposition 4.1 together with Corol- 
lary 8.1.2 in Ethier and Kurtz (1986). □ 

An immediate question that arises is whether the a-diversity diffusion is 
stationary. The following proposition, which concludes the section, provides 
a negative answer. 

Proposition 4.2. Let {St,t>0} be as in Theorem 4-1- Then there 
exists no stationary density for the process. 
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Proof. A stationary density, if it exists, is given by 

iP{x)=m{x)[CiS{x) + C2], x>0, 

where m{x) = [s{x)a'^{x)]~^ , s(x) and S{x) are as in (4.6), and Ci,C2 are 
constants determined in order to guarantee the nonnegativity and integra- 
bility to one of ip. Here s{x) = e~'^^l^ and 

5(x)=xe2/5/^-2/3Ei(2/3/a;) 

so that 

^{x) = Ci- 2/3Cix-ie-2^/^ Ei(2/3/x) + Cax'^e-^^/^ 

The second term is not integrable in a neighborhood of infinity, since there 
exists an xo > such that 

Ei(2/3/x) > x"^ for all x > xq, 

hence Ci must be zero. Since neither the third term is integrable, this gives 
the result. □ 



5. Normalized inverse-Gaussian diffusions. The a-diversity process con- 
structed in the previous section is a key component in the definition of 
the class of normalized inverse- Gaussian diffusions. In this section we char- 
acterize such infinite-dimensional processes in terms of their infinitesimal 
generator, and show that they can be obtained as the limit in distribution 
of a certain sequence of Feller diffusions with finitely- many types. The as- 
sociation of the limit family with the class of normalized inverse- Gaussian 
random probability measures will instead be shown in Section 6. 

Consider the (n — l)-dimensional simplex 

An= [0,l]":z,>0,^z, = l| 
and the closed subspace of A„ given by 

A„ = jzG [0,l]":zi>e„,^Zi = l|, 

so that £n ^ Zi < 1 — {n — l)e„ for Zj G A„, where {Sn} C ]R_|. is a nonincreas- 
ing sequence such that 

(5.1) 0<en<- Vn>2, ne„|0. 

n 

Define, for {zo,zi, . . . , z„,) G (0, oo) x A„, the differential operator 
"^""2 2^ dz, dz, ^2^^' ^ ' dz, ' 
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where the covariance components (a."^(z))j^j=o,...,n are set to be 

20, i = j = 0, 

[zi - £n){5ij{l - nsn) - {Zj - 1 < < n, 

else, 

and, for /3 > 0, the drift components are 

b't\z) = ^^{l-z.)-lz, 
zo(n-l) zq 




a(l - exp{-(2:i - e„)e^/^"}), i = l,...,n. 



Observe that a\j\z), for 1 < j < n, can be seen as a Wright -Fisher type 
covariance restricted to [e„, 1 — (n — l)e„,]", since for such indices i,j 

(5.2) a^{z) = I - - - l^'- - ^ 

^ {-{Zi-en){zj - En), i j , 

and that the first two terms in bf''\z), i = 1, . . . ,n, equal 

-(1 - (n - l)e„ - Zj) (zj - En) 



zo(n-l) zo 

from which the behavior at the boundary is clear. For ease of exposition and 
in analogy with the previous section, whenever convenient we will denote zq 
by s, so that for instance An can be written more explicitly, 

s 2 9s"^ 2 ^ dzidzj 2 ^ * 

i,j = l 4=1 

with a^J^(z) and 6j-"'*(z) as above. The domain of An is taken to be 

(5.4) &{An) = {/:/ = /o X /i, /o G &{Ao)Jl G C\Kn)}, 

where (/o x /i)(s,z) = /o(s)/i(z), ^(A) is (4.10), and 

C\Kn) = {/ G C(A„) : 3/ G C2(M"), = /}. 

The operator An drives n + 1 components: those labeled from 1 to n can 
be seen as the frequencies associated to n species in a large population, 
bounded from below by e^; the zq or s component is a positive real variable 
which evolves independently according to the a-diversity diffusion (4.2) and 
contributes to drive the drift of the other n components. 

Denote by Co([0,oo) x A„) the Banach space of continuous functions on 
[0, oo) X A„ which vanish at infinity, equipped with the supremum norm 
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= sup^.gjQ oo)xA„ /(^)' ^^'^ ^{B) the set of Borel probability mea- 
sures on B. Recall that a Markov process {X{t),t > 0}, taking values in a 
metric space E, is said to correspond to a semigroup {T{t)}, acting on a 
closed subspace L of the space of bounded functions on E, if 

E[/(X(t + = T{s)f{X{t)), i > 0, 

for every / G L, where = a{X{u),u <t). 

Proposition 5.1. Let An he the operator defined in (5.3) and (5.4)- 
The closure in Co([0, oo) x A„) of An generates a strongly continuous, pos- 
itive, conservative, contraction semigroup on Co([0,oo) x A.„). For 
every Vn € =^^([0, oo) x A„) there exists a strong Markov process Z^'^\-) = 
{Z^'^\t),t > 0} corresponding to with initial distribution f„ and 
sample paths in C'[ooo)xA (P'^)) ^^^^ probability one. 

Proof. We proceed by verifying the hypothesis of the Hille-Yosida the- 
orem. Note first that An satisfies the positive maximum principle, that 
is, for / G Si{An) and G (0,oo) x A„ such that ||/|| = /(s*,z*) > 

we have < 0. Indeed, writing An = + An,i to indicate the 

first two and last two terms in (5.3), it is immediate to check that 
and An,i satisfy the positive maximum principle on [0,oo) and A„, respec- 
tively. If /o(s*) > 0,/i(2:*) > 0, then ^o/o(s) < and An,ih{z) < 0, while if 
fo{s*) < 0,/i(z*) < 0, then Aofo{s^) > and ^n,i/i(-z*) > 0. In both cases 

Anfis*,Z*) = /l(z*)A/o(s*) + /o(s*)^n,l/l(2*) < 0. 

Let now L C S!{An) be the algebra generated by functions / = /o x /i, 
with /o G ^(A), ^(A) as in (4.10) and h = z^ = z^' ■ ■ ■ z^^" G C2(A„), 
Cj G No, so that L is dense in Co([0,oo) x A„), and so is !^{An)- Denoting 
c + = (co, . . . , Cj + d, . . . , Cn), for / G L we have 

Anifois) X Z^) 



fi{z)Aof{s) 



(5.5) 



Ci{c, - l)(-z'= + (1 - (n - 2)e„)e„z'^-iw 
-e„(l-(n-l)e„>^-2(0) 

n 

+ 5^ Qc,-(-z^ + e„(z'=-i(^) + z'^-iw 

-EnZ'^-^W-^t^))) 
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" ^ (3 



Ci 
i=l 



Zo{n - 1) 



so that the image of An contains functions of type /o x and /o x e~^°^'z'^, 
with 5o fixed. For every ^(x) G C(-ftr), with compact, and /(x) = ^°^g{x) € 
C{K), there exists a sequence {p*-'^-*} of polynomials on ii' such that ||/ — 
II 0, so that ||e~*°^p^^) — (7II — 0. It follows that the image of An is dense 
in Co([0, 00) X A„), and so is that of A — An for all but at most countably 
many A > 0. The first assertion now follows from Theorem 4.2.2 of Ethier 
and Kurtz (1986) and by noting that 1 G &{An) and Anl = 0, that is, An 
is conservative. The second assertion with -D^q ^^^^^ ([0, 00)), the space of 
right-continuous functions with left limits, in place of C'lo oo)xA„([^'°'^))' 
follows from Theorem 4.2.7 of Ethier and Kurtz (1986). To prove the al- 
most sure continuity of sample paths, it is enough to show that for every 
z* G (0,00) X An and e > there exists a function / E &{An) such that 

/(Z*) = 11/11, sup /(Z)</(Z*), Anfiz*)=0, 

where B{z* ^eY is the complement of an e-neighborhood of z* in the topology 
of coordinatewise convergence [cf. Ethier and Kurtz (1986), Remark 4.2.10]. 
This can be done by means of a function / S &{An) which is flat in z* 
and rapidly decreasing away from z* , for example, of type f{z) = ci — 
C2 X^iLo(-2i ~ for appropriate constants ci,C2. □ 

For Z(")(-) as in Proposition 5.1, consider now the mapping pn{Z^"'\-)), 
where pn : [0, 00) x A„ — )• [0, 00) x Vqo is defined as 

(5.6) Pn{z) = (2;o,Z(i),2;(2),...,Z(n),0,0,...), 

{z(^i-^, . . . , Zf^n)) is the vector of decreasingly ordered statistics of (zi,...,z„) G 
An, and Voo is the closure of the infinite-dimensional ordered simplex, de- 
fined in (1.1). The following proposition states that pn{Z^"'\-)) is still a 
well-defined Markov process. Define 

V„ = |z G Voo-Zn >£n> Zn+1 =0,^^^; = l| 

and observe that z G Voo satisfies < 1/i for all i, so that Vn is nonempty 
by (5.1). 
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Proposition 5.2. Let An be defined by the right-hand side of (5.3), 
with domain 

where &{Ao) is as in (4-10) and 

CpJVn) = {/ G C(V„) : / o p„ G CHAn)}. 

Then the closure of An in Co([0,oo) x V^) generates a strongly continuous, 
positive, conservative, contraction semigroup {Tn{t)} on Co([0,oo) x V^). 
For every S ^([0,oo) x A„), let Z^") be as in Proposition 5.1. Then 
Pn{Z^'^\-)) is a strong Markov process corresponding to {Tn{t)} with initial 
distribution i^n° Pn^ ctnd sample paths in C'[ooo)xV ([^'°'^)) with probability 
one. 

Proof. The result follows from Proposition 2.4 in Ethier and Kurtz 
(1981), with A„,, V„ and p„ there substituted by [0,oo) x A„, [0,oo) x V„ 
and (5.6), respectively. □ 

We now turn the attention to the limit of pn{Z^'^\-)) when the number of 
types goes to infinity. To this end, consider that Vqo is a compact and metriz- 
able space in the topology of coordinatewise convergence, and let Co([0, oo) x 
Voo) be the Banach space of continuous functions on [0, oo) x Vqo which van- 
ish at infinity, with the supremum norm ||/|| = sup^^jQ oo^xVoo 1/(^)1- '^^^ 
issue for showing that the closure of the differential operator A, defined in 
(1.4), generates a Feller diffusion on Co([0, oo) x Voo) is the choice of the do- 
main of A. Here we adapt to the present framework a technique indicated by 
Ethier and Kurtz (1981). Consider polynomials (pm : Voo — ^ [0, 1] defined as 

oo 

(5.7) 'Pi{z) = l, (^„,(z) = J^zf, m>2. 

1=1 

Since z £ Voo implies < functions (pm with m > 2 are uniformly con- 
vergent, and sums in (1.4) are assumed to be computed on 

(5.8) Voo = f^z = {zi,Z2,...):zi>Z2>--->0,Y,Zi = l^ 

and extended to Voo by continuity, so that, for example, 

°° d 

^(1 + Zi) Q-Mz) = 2 + 2ip2{z) 
i=i 

instead of 2^^-^ Zi + 2ip2{z). Write 

(5.9) A = Ao + Ai 

to indicate the first two and last two terms in (1.4), and denote 

(5.10) &{Ai) = {sub-algebra of C(Voo) generated by as in (5.7)}. 
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The domain S^{A) of the operator (1.4) is then taken to be 

S^{A) = {sub-algebra of Co([0, oo) x Voo) generated by 

(5.11) 

/ = /ox/i:/oGi^(A),/iG^(^i)} 
with Si{Aq) as in (4.10) and ^(Ai) as above. 

Lemma 5.1. The sub-algebra &{Ai) C C(Voo) is dense in C(Voo). 

Proof. See the proof of Theorem 2.5 in Ethier and Kurtz (1981). □ 

We also need the following lemma, which shows that the operator Ai is 
triangulizable. 

Lemma 5.2. Let Ai be as in (5.9) and, for any m>2, let L^a be the 
algebra generated by polynomials as in (5.7), with degree not greater than 
m. Then Ai:Lm.^ Lm- 

Proof. The assertion follows from equation (2.4) in Feng and Sun 
(2010), with e replaced by /3/s. □ 

Then we have the following result. 

Theorem 5.1. Let A be the operator defined by (I.4) and (5.11). The 
closure in Co([0,oo) x Voo) of A generates a strongly continuous, positive, 
conservative, contraction semigroup {T{t)} on Co([0,oo) x Voo)- For every 
V G =^^([0, 00) X Voo), there exists a strong Markov process Z{-) corresponding 
to {T{t)} with initial distribution u and sample paths in C[o oo)xVoo ([^' 
with probability one. 

Proof. For every g G Co([0,oo) x Voo), define r„:Co([0,oo) x Voo) — > 
Co([0,oo) X V„) to be the bounded linear map 



rng = g\ 



[0,oo)x V„ 



given by the restriction of g to [0,oo) x V„. Note that r„ : ^{A) — > &{An), 
with An as in Proposition 5.2, and that 

(5.12) ||r„ff- 511^0, gGC7o([0,oo) X Voo). 

Then, for g G &{A) and z G (0, 00) x V„, we have 

\Anrng{z) - rnAg{z)\ 

{n), . d'^g{z) 



dzj dzi 
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with a^^\z) as in (5.2). In particular, 

_ ( en[{zi- en)in-l) + l- Zi], i = j, 

{ ^n[Zi + Zj — Sn], ^T^Jj 

which is bounded above by ne„, from which 
\Anrng{z) - rnAg{z)\ 



(5.13) 



d^g{z) 



dzi dzj 



+ 



(3 



zo(n-l)§ 



dg{z) 



dzi 



i=l 



dg{z) 



dzi 



For g E S>{A) of type g = f^x fi, with /o G ^(^o) and /i = ipm^ ■ • • (pm^^ , we 
have 



(5.14) 



so that 



E 



a5r(z) 



dZi 



n k 

i/o(^o)i XI Yi "^i^r'~^ n '^"^^ 
i=i j=i 



< l/o(2;o)lX]"^iX]^' 



rrii — l 



Xexp{-(zi -e„)ei/^"} 



i=l 



^^(z) 



dzi 



<ne„|/o(zo)|Xmj^O 



uniformly as ?i — )■ oo by (5.1). Furthermore, 



E 



< 1/0(2^0)1 X] dijLfm^YlLfm.^+^diipm^djLpm, JJ '^'"'^ 



\foizo)\ 
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<l/o(^o)| 



mhinih - 1) + 2 ^ inhnig 

q^h 



whose right-hand side is bounded. Since also the right-hand side of (5.14) is 
bounded above by |/o(-zo)| '^j=i^ji it follows by (5.1) that the right-hand 
side of (5.13) goes to zero uniformly and, by means of (5.12), that 

(5.15) WAnVng-AgW^O, g£&{A). 

Proposition 5.2 implies that An is a dissipative operator for every n > 1, 
so that by (5.15) A is dissipative. Moreover, Lemma 5.1 and Lemma 5.2, 
respectively, imply that &{A) and the range of \ — A, for all but at most 
countably many A > 0, are dense in Co([0,oo) x Voo). The fact that the 
closure of A generates a strongly continuous contraction semigroup {T{t)} 
on Co([0,cxd) X Voo) now follows from the Hille-Yosida theorem [see Theo- 
rem 1.2.12 in Ethier and Kurtz (1986)]. It is also immediate to check that 
Al =0, so that (1,0) G {g,Ag) and {T{t)} is conservative. Finally, (5.15), 
together with Lemma 5.1 and Theorem 1.6.1 of Ethier and Kurtz (1986), 
implies the semigroup convergence 

(5.16) \\Tn{t)rng-Tit)g\\^0, g £ Coi[0,oo) xV^), 

uniformly on bounded intervals. From Proposition 5.2, {Tn{t)} is a positive 
operator for every n > 1, so that {T{t)} is in turn positive. 

The second assertion of the theorem, with oo)xVoo(['-*'°'^)) place of 
CjQ j^^j^y^ ([0, oo)), follows from Theorem 4.2.7 in Ethier and Kurtz (1986), 
while the continuity of sample paths follows from a similar argument to that 
used in the proof of Proposition 5.1. □ 

The following corollary formalizes the convergence in distribution of the 
sequence of processes of Proposition 5.2 to the infinite-dimensional diffusion 
of Theorem 5.1. 

Corollary 5.1. Let Z(")(-) be as in Proposition 5.1 with initial distri- 
bution Un £ ^{[0,oo) X A„), and let Z{-) be as in Theorem 5.1 with initial 
distribution v G ^([0,oo) x Voo). //^^n ° Pn^ =^ u on Voo, then 

as oo. 
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Proof. The result with L>jo^^)x^^([0,oo)) mp\aceofC^f^^^^^^^{[0,oo)) 

follows from Proposition 5.2, together with (5.16) and Theorem 4.2.5 in 
Ethier and Kurtz (1986). The fact that the weak convergence holds in 
CjQ oo)x Voo ([^' °*^)) follows from relativization of the Skorohod topology. □ 

6. A population model for normalized inverse- Gaussian diffusions. By 

formalizing the population process briefly mentioned in Section 4 for con- 
structing the a-diversity diffusion, in this section we provide a discrete ap- 
proximation, based on a countable number of particles, for the diffusion 
with operator (1.4). More specifically, this is obtained as the limit in distri- 
bution of the process of frequencies of types associated with a set of particles 
sampled from a normalized inverse-Gaussian random probability measure, 
jointly with the normalized version of the diversity process. 

In view of (2.4), the conditional distribution of the ith. component of 
an exchangeable sequence {Xi, . . . ,Xn) drawn from a random probability 
measure of Gibbs type can be written 

IP{Xj G -{Xi, . . . ,Xi-i, Xi^i, . . . ,Xn} 

(6.1) = go{n- l,Kn-i,i)M-) 

+ gi{n-l,Kn-i4) ^ {uj -a)6x*{-), 
i=i 

where fo is a nonatomic probability measure and {XI , . . . , ^ , ) are the 
Kn-i^i distinct values in {Xi, . . . , Xj+i, . . . ,Xn). For fixed n, define a 
Markov chain {X^'^) {m),m> 0} on X'' by means of the transition semigroup 

Tnfix) = j f{y)pn{xAy). f G CoiX^), 

where x,y £ X", Co(X") is the space of Borel- measurable continuous func- 
tions on X" vanishing at infinity, 

1 " 

(6.2) Pnix,dy) = - '^piidyi\x{-i))Y[6xkidyk), 

Xj-.j) = (xi, . . . , Xj-i, Xj+i, . . . , x„) and pi(dy|x(_j)) is (6.1). The interpreta- 
tion is as follows. At each transition one component is selected at random 
with uniform probability, and is updated with a value sampled from (6.1), 
conditional on all other components, which are left unchanged. Hence, the 
incoming particle is either a new type (a mutant offspring) or a copy of an 
old type (a copied offspring). Embed now the chain in a pure jump Markov 
process on X" with exponentially distributed waiting times with intensity 
one, and denote the resulting process by X^^\-) = {X^'^\t),t > 0}. The in- 
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finitesimal generator oi X^'^\-) is given by 
1 " 

BrJ{x) = -Y,9t'''HP^fi^)-fi^)] 



(6.3) 



n . 
1=1 



n i^n—l.i 



n 

i=i j=i 



with domain 

(6.4) ^(i?„) = {/:/GCo(X")}. 
Here : Co{X^) Co(X"-^) is defined as 

(6.5) ^j*if{xi,.. .,Xn) = f{xi, . . . ,Xi_i,Xj,Xi+i, ...,Xn) 

for X* G {xl, . . . , x*j^^ ^ ,), P is the transition semigroup 

(6.6) Pg{z) = j g{y)pi{z,dy), ffeCo(X), 

where pi{z,diy) is given by 

(6.7) pi{z,<\y) = VQ{dy), 

Pif denotes P acting on the ith coordinate of /, and we have set for brevity 

(6.8) gf"-^''^ =gj{n-l,Kn-i,i), j = 0,l. 

Defining (6.6) and (6.7) separately is somewhat redundant, but wiU ahow us 
to provide a general expression for the global mutation rate in this particle 
representation before making the assumptions of nonatomicity and parent 
independence as in (6.7). See (A. 15) below. 
Define now the map w : X" — t- Voo by 

(6.9) wix) = u;(xW) = {zi,...,ZK„,0,0,-.-), 

where Zj and Kn, respectively, denote the relative frequency of the jth most 
abundant type and the number of types in X^"'\ Let also A be as in (1.4). 
The next theorem states that 

(6.10) [i^„(-)/n-,^(xW(.))] = {[i^„(t)/n-,u;(X(")(t))],t>0}, 

if appropriately rescaled in time, converges in distribution to the process 
with generator A. The proof is deferred to the Appendix and contains, as a 
byproduct, a more formal derivation of Theorem 4.1. 
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Theorem 6.1. Lei be the X"" -valued process with generator (6.3) 

and (6.4), w.H"^ — ^ Vqo as in (6-9) and Z{-) as in Theorem 5.1. If 

(6.11) [K„(0)/n-,u;(xW(0))] ^ Z(0), 
then 

(6.12) [Kn{n^l'^t)/n'^,w{X^''\nH/2))] Z{t) 
S,oo)xv..([0'°o))- 

We conclude the section by showing the reversibility of the particle pro- 
cess. Denote the joint distribution of an n-sized sequence from the general- 
ized Polya urn scheme (2.4) by 

^^(dxi, . . .,AXn) 



n-l 

i'o(dxi) JJ 

i=l 



Ki 

go{i, Ki)uo{dxi+i) + gi{i, Ki) '^{nj - a)^.(dxi+i) 



Proposition 6.1. Let X^'^\-) be the X"' -valued process with generator 
given by (6.3) and (6.4). Then X^^\-) is reversible with respect to 

Proof. Let qn{x,dy) denote the infinitesimal transition kernel on X" x 
,^(X") of X^'^\ Denoting by A„ the rate at which the discontinuities of X^"^^ 
occur, and recalling (6.2), we have 

.y^n{dx)qnix,dy) 



1 " 

: ^„(d3;)A„- ^Pi(d?/ik(-i)) n '^^fc 



n 

i=l k^i 



A 

: ^^.^„_i(dx(_i))pi(d2;i|2;(_i))pi(dyi|x(_i)) JJ(5j,j^(?/fc) 

A " 

: ^ J]]^„_i(dy(_j))pi(dxi|y(_i))pi(dyi|y(_i)) JJ(5j;,(xfc) 

1=1 k^i 
1 " 

: J^n{dy)-^^ \nPn{dxi\y^i) 5y^ {xk) = ^n{dy)qn{y, dx), 



n 

i=l kyti 



giving the result. □ 

7. Conditioning on the alpha-diversity. We conclude by discussing an in- 
teresting connection with the two-parameter model (1.2). In the Introduction 
it was observed that conditioning on the a-diversity diffusion St to be con- 
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stant in the operator (1.4) only yields the special case /3 = and a = 1/2, 
consistently with the associated random probability measures. It turns out 
that performing the same conditioning operation in the particle construc- 
tion of the previous section, before taking the limit for n — )■ oo, yields a 
particular instance of the two-parameter model. The following proposition 
states that under this pre- limit conditioning with 5t = s, the normalized 
inverse-Gaussian model with operator (1.4) reduces to the two-parameter 
model with {9, a) = ((as)^, a) and a = 1/2. 

Proposition 7.1. Let X'^^\-) be as in Theorem 6.1, w he as in (6.9), 
Z^"'\-) be defined by the left-hand side of (6.12), and denote by y^'"(-) the 
process with operator as in (1.2). Then 

C'[o,oo)xV^([0'°o)) asn^oo. 

Proof. Let a = 1/2 throughout the proof. In the pre-limit version of 
the process of frequencies derived from the particle process, that is, (6.9), 
conditioning on St = s means conditioning on Kn{-) being constant over 
time, hence with zero conditional first and second moment. Denote 

z — Ei = {zi, . . . , Zi-i,Zi — 1, Zi-^-l, . . .) 

and assume z has k nonnull components obtained from n particles. Then, 
as in Section 6, when a particle is removed we have the change of frequency 

Z^ Z W.p. Zi, 

n 

where z — £i/n has 



(1) : k nonnull components w.p. 1 

(2) : k — \ nonnull components w.p. 



n 



n 

Conditional on case (1), the number of nonnull components remains k if the 
incoming particle is a copy of an existing type, that is, we observe either of 

, (n,k) / -, \ 1/1 (n,k)\ 

z H>z h— W.p. '{ni-l-a) [l-g^Q '), 

nun 

, (n,k) / \ if-i {ri,k)\ 

z ^z + ^ W.p. 5^ '{nj-a) {l-gl^ 

nun 

where g^'^^ and g^'^^ are as in (6.8), while conditional on case (2) we 
observe 

z I— )-z 1 w.p. 1. 

n n n 
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For A„ = ?^^/2, the generator of the process Z^") (•) in this case can be written 



■ Um ^ 
(540 Ot 



+ /l 1 ^ - — + — 1 



n n I n 



+ {l-\M)h{z) + 0{{5tf)-h{z) 



(7.1) V„ = LGVoo:^„+i = 0,^Zi = ll, 



2 = 1 



and Voo as in (5.8). Exploiting the relation 



^ giEj=iK -«) 

n J 1-50 



+ ^ = 1, 
n 



we can write 

Sn,lfl{z) 



An ^ ^2 1 [/i (^) - /i (^)] - ^ J i_gM 



(7.2) 



/l(z-^ + ^)-/l(z) 



n n 



n 



\ (n,k) I N 

^ _ mx^n \ g\ '{nj-a) 



n n 



h{z) 



{n,k) 



n 



+ AnE 



n n 



hiz) 



Zi 



n n 



fiiz) 



n 



n 



\ {n,k) I N 

^ _ mx,n \ 9\ '{nj-a) 



(n,fc) 
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By making use of Taylor's theorem, it can be easily verified that the following 
three relations holds: 



E 



fi[z + ^ 

n n 



fi(z + ^ 

n n 



f I £i . £k+i 
fi[z + 



■fiiz) 
■fliz) 
■fi{z) 



dzi dzj 



+ o(n 



n ^ dzi n dzi ' 



n ^ * dzi 



n n 

By means of the last three expressions, we can write (7.2) as 



A. 



Bn,ifi{z) = -^y^Zi{6 



dzi dzj 



+ <i/i(z)+o(A„n-2), 



2J 



where B^^ifi{z) is the drift term, given by 



An 



n 



akgi 
1-90 



mi 



n 



mi. 



n 



.n \ dfi 



1-50 



mi^n\ sr^ dfi 
n / ^ dz^ 



dzi 



Using (3.5) and Proposition 3.2, it can be seen that 



akgi 
1-50 



771,1 



n 



mi 



n 



akmi : 



and 



yielding 



1-90 



TTli 



n 



a 
n 



n 



Bn,ifi{z) = -^yZ^ii^' 



— 2^((as) + a) — + o(A„?i ^ 



n 



For / G C(Voo), define : C(Voo) ^ C(V„) to be 
(7.3) rn/ = /|v„, 
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namely, the restriction of / to V„,, and note that 

\\fnf-f\\^0, /GC(Voo). 

Recalhng that A„ = imphes that for / as in (5.10) and as in (1.2), 

||£(°^)''°/-e„,if„/||^0. 

The strong convergence of the corresponding semigroups on C(Voo), similar 
to (5.16), and the statement of the proposition now follow from an applica- 
tion of Theorems 1.6.1 and 4.2.11 in Ethier and Kurtz (1986), together with 
the relativization of the Skorohod topology to C^q ^^^^^{[0,oo)). □ 



APPENDIX 

Proof of Proposition 3.2. Consider first that Vn,k appearing in (2.5), in 
the case of generalized gamma processes, can be written [cf. Lijoi, Mena and 
Priinster (2007b)] 

Vn,k = ^ / x"exp --[(r + x)°-r"] (r + x)°^-"dx. 
r(n) Jo I a J 

Together with (2.3), this leads to writing 

(A.l) go{n, k) = Vn,k-in -ak)Vn+i,k ^ ^ _ _ 

where 

_ /o°°^"exp{-(a/a)[(r + x)° -r"]}(T + x)"*^-"-Mx 
Denote by f{x) the integrand of the denominator of w{n,k), so 



w{n,k) = J J f{x)dx. 

Since f{x) is unimodal, by means of the Laplace method one can approxi- 
mate f{x) with the kernel of a normal density with mean given by 

(A.2) X* = argmaxx"-^ exp| --[(r + x)" - r°] 1 (r + x)"^'"" 
x>o [ a J 

and variance given by - [f"{x)]-^\x=x*- It follows that 

fN{x*]\f)C{x*pj, —[f'j(j{x)] 



w{n, k) 



f[xl)C{xl,-[f"{x)]-%=^.) ' 



where /tv denotes the integrand of the numerator, x^ and x]-, the modes 
of the integrands of numerator and denominator, respectively, and C{x,y) 
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is the normalizing constant of a normal kernel with mean x and variance y, 
yielding 



From (A. 2), the mode x*^ is the only positive real root of the equation 

(A.4) (n - l)x-^ + {ak - n)(T + x)'^ - a{T + x)""^ = 0, 

which, for a ^ 1/2,1/3, involves finding roots of polynomials of degree 
greater than 4. When a = 1/2 we have 

Xjj - 



+ 



I2a? 3 

48aV(n - l){k - 2) + [4aV - (A: - 2)^]^ 
6 • 2^/'^a^pi^£){a, r, n. A:) 



^ 12-2V3a2 ' 
where 

pi,D(a,T,n,/c) 

= {P2,D + [pId + 4(-48aV(n - l)(fc - 2) - (4aV - (A; - 2)2)2)3]i/2|i/3 
and p2^D = P2,D{a, t, n, k) with 

P2,D(a,T,n,/c) = 2(/c - 2f[{k - 2f - 12aV(A; + 4 - 6n)] 

+ 96a^T2(A;(A: + 2) + 10 - 6n{k + 4) + ISn^) 

Similarly, one finds that 



128a^r^ 



where 



, _(fc-2)2_r 
12a2 3 

48aVn(A; - 2) + [Aa'^r - {k - 2ff 
6 • 2'^/'^a^pi^p^{a, r, n, A;) 

pi^N{a,T,n,k) 
12 • 21/3^2 ' 

Pi,N{a, T, n, A;) = \p2,N + (Ps.Tv)^^^]^''^ 



with 

P2,Af (o, T, n. A:) = P2,d(«, n, A;) - 144a^T[4a^r(A: + 1 - 6n) - (A: - 2)^] 
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and 

P3,N{a,T,n,k) 

= -3^2^°a%'^T^[{k - 2)^ - 2n{k - 2f 

- 4aV[8 + 2k'^ + 9n(3n + 4) - 2k{9n + 4)] + IGaV^]. 
When k ~ sn" [cf. (3.7) above] and a = 1/2, it can be checked that 
(A.5) n-V,i(a,T,n,/c) ^2^/3^^ i = N,D, 

from which 

(A.6) n-^x*^{s/2af, i = N,D. 

Using this fact, one finds that 

fix},) ^ l + r(5/2a)-2- 

Computing also the ratio of the two second derivatives, it can be seen that 
w{n,k) —7- 1, which by means of (A.l) and (2.6) imphes (3.8). In order to 
find the speed at which w{n, k) goes to 1, consider 



1 — win.k) = I f(x)dx/ / f(x)dx. 

Jo T + x'^ ^ / Jo 

The denominator is unchanged, while the mode of r/(r + x)f{x) is 
(A; -4)2 t 



t'* 



12a2 3 
-ASaHjn - l)(fc - 4) - [Aah - jk - 4)^]^ 
6 • 22/3a2p/^ £)(o5 f^i 

p[j^{a,T,n,k) 
^ 12 -21/302 ' 

where 

Pi,D{a,T,n,k) 

= {P2,D + [pId + 4(-48a2t(n - 1)(A; - 4) - {AaH - [k - ^ffff^f/^ 
and p2,D = P2,D{a, t, n, k) with 
p'2^jj{a,T,n,k) 

= 2{k - Af[{k - 4)3 - l2a^T{k + 2- 6n)] 

+ ma^T'^{k{k - 2) + 10 - 6n(fc + 2) + ISn^) - 64a^T3. 

Moreover, p'2 satisfies (A.5), and (A.6) follows. Unfortunately the fact that 
the two modes grow with an asymptotically equivalent rate is too rough an 
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approximation for our purposes here, which ignores how far apart they are 
if this is neghgible with respect to the growth speed. Indeed, it turns out 
that 

n~y\xh-xl,)^s/a\ 

Using this information in the Laplace approximation for w^n, k) — 1 yields 

n(l — w{n, k)) — ia^fr j s = (ij s 

with /3 as in the statement of the proposition. From (A.l) it is now easy to 
see that 

ngo{n, k) — akw{n, k) = n(l — w{n, k)), 
which provides the second order approximation for §0(12, k), 

go{n,k) = \ ho(n ), 

n n 

where the first term is of order n~^/^, yielding immediately, by means of 
(2.6), the second order term for gi(n,k), that is, 

gi{n,k) = 5- + o(n ). 



n 



Proof of Theorem 6.1. We can write the generator of (6.10) as 

(A.7) Bnifo X /l) = flBn,0f0 + foBn,lfl, 

where Bn,o and Bn,i drive Kn{-)/n" and w{X^'^\-)), respectively, and /o G 
&{Aq), with &{Ao) as in (4.10), /i G C^(V„) and is as in (7.1). Based 
on (4.1), while retaining 

771]^ 72 tenipora,rily, wg Ccin write 



: lim — <! 5t 

540 ot 



fo 

+ /o 



k + 1 



n 



mi n \ (n-l,k) 



k-l\mi^n (n-i,fc-i), . /, 1^^ 
' g\ [n — 1 — a[k — 1)) 



n 



mi^n \ {n-l,k) 

n n 



!!|£i^("-i.'=-i)(„_l_«(A:_l)) 



(l-<5t)/o(^)+0((5t)^)-/o(-- 



TV- 
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n 



, "il,n \ (n-l,k) 

1 Iffo 



fo 



k + 1 



+ ^5i"-'''=-')(n-l-a(/t-l)) 



n 



k-l 



fo 



An application of Taylor's theorem yields 



Bn,ofo 



k 



, mi n \ (n-l,fc) 

1 l9o 



n 



+ ^(7i"-'''=-')(n-l-a(fc-l)) 



n 



X 



/o 



k 



-f'(-\+^f"(- 



k 



, "T-l.n \ (n-l,fc) 

1 9o 

n / 



n 



1 .YA 



1 5'0 

71/ 



+ 0(n-^"). 
Prom (4.4) and (4.5) we have 
k 



n 



+ o(n ^) 



1 .'/^ ^ 



+ 0(n 



n 



k 
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from which it fohows that 

(A.8) ||A/o-^^'/X,o/o||^0 

as n — )• oo, with as in (4.9). Since (6.11) impHes Kn{0)/n"' =^ ^o, the 
previous expression, together with Theorems 1.6.1 and 4.2.5 in Ethier and 
Kurtz (1986), imphes (4.3) with C[o,oo)([05 oo)) replaced by Z)[o,oo)([0) co)), 
while (4.3) follows from relativization of the Skorohod topology to 
C[o,oo)([0,oo)). 

In order to describe i3„,^i in (A. 7), define 

(A.9) </.„(/i) = /i((/ii,/i),...,(/i„,/i)), /iGC2(M"),/iiGC(X), 

and 



(A.IO) ;,„(t) = iV5 

T) ^ ^ 



n '-^ ^ ' 

Then the generator of the ^(X)-valued process /U„(-) = {/.i„(t),t > 0} can 
be written 



n ^ ozi 



(A.ii) - ^ f]ffi"-^'%._i,4(Q("-^'^) - ^ 

1 r//,. A . ,.\_ih. ,.\ih. ,,\i 



+ - XI 91 [{hihj,ij)-{hi,ij){hj,ij)] 



n ^ — ' " " " ■ . - ' 

l<fc^i<n ■' 

where Qz^ is the derivative of g with respect to its ith argument and Q^""^'*) 
is defined as 

(A.12) g("-^'')5(^) = j 5(2/)p:-i,.(2,d2/), G Co(X), 

with 

(A.13) p;_^^(z,dy) = - Yl %(d2/). 

Unlabel now the model by choosing fj^ni^j) as in (A.9) with hj{-) being the 
indicator function of the jth largest atom in ^u, so that {hj,fj,) = zj is the 
relative frequency associated with the jth most abundant species. Note that 
some arguments of fi(zi, . . . , Zn) can be null since Kn-i,i < Kn < n. With 
this choice we have {hihj,fi) — {hi, ii){hj, fi) = Zi6ij — ZiZj, where 6ij is the 
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Knonecker delta, and, under (6.6) and (A. 12), 

n 

{Phi,^) - {hi,n) = '^pi{x*,dx*)zj - Zi 
i=i 

n 

i=i 

and 

n 

It follows that B„ reduces to 

^"'1-^1 = ^ Z^^o""^''^ ( ^[Pi{x*Ax*) - 6ij]zj J ^ 
i=l \i=i / * 

(A.14) - ^J29t'''^^n-i, f f;[K-i,H,dx:) - 6,,]z] ^ 



n 

i=i \j=i 
1 " 



n ^ — ' " " "■ dzi dzj 

i,j=i 



Here a mutation from type i to type j occurs at rate qij = qij{z) given by 
lij = -[5i"~^''-'bi(^,{i}) - Sij] 

(A.15) 

- agl"" ^''^A'„„i,iK_i,i(i, {j}) - Sij]], 

where pi{i,{j}) and stand for pi{x*,dx*) and dx*). 

When (6.7) holds, from the nonatomicity of vq we have pi{z,dy) = for 
every y G X, and when (A. 13) holds, we have ^{z,dy) = K~\ ■, from 
which 

%i = - [-(>ij9o -agl ' + ag\ 'Kn-i,idij\ 



1 B/s 
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where the second equahty follows from (2.6) and the third from (3.9). Once 
again it is clear that the key point for determining the limiting behavior of 
the diffusion is the second order approximation of the predictive weights, as 
obtained in Proposition 3.2. Hence, we have 



n 



i=l j=l 



E 

i=l 



Zij3/s 



+ 



a 



+ o{n 



n{n — 1) n{n — 1) 
from which (A. 14), substituting (3.9) in the third term, reduces to 

= > ] [n"''' - 0{n~^)]zi{6ij - 



n 

E 



dzi dzj 



E 



+ 



a 



n{n — 1) ' n[n — 1) 



which in turn implies that 
(A.17) \\Aih 



{r?/2)Bn,ifnh 



+ o{n 







dzi ' 



with ^1 as in (5.9), /i G Si{Ai) as in (5.10) and f„ as in (7.3). From (A.8) 
and (A.17) it follows that 

P(/0 X /l) - {rnhW^Bnflh - hn^Bn,lfnh/n ^ 

with A as in (1.4) and (/o x /i) as in (5.11). The fact that (6.12) holds in 
Dp ^-j^y^ ([0, oo)) follows from the density of (5.11) in Co([0,oo) x Vqo), 

together with Theorems 1.6.1 and 4.2.11 in Ethier and Kurtz (1986), which 
imply, respectively, the strong convergence of the associated semigroups on 
Co([0, oo) X Voo), similarly to (5.16), and the weak convergence of the prob- 
ability measures induced on ^^^^^ ([0, oo)). The same assertion with 

CjQ oo)xVoo ([^' °°)) place of -DjQ oo)xv<x>([^' °°)) follows from relativiza- 
tion of the Skorohod topology. 
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